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Statistical inference for stochastic processes with time-varying spectral characteristics has re- 
ceived considerable attention in recent decades. We develop a nonparametric test for stationarity 
against the alternative of a smoothly time-varying spectral structure. The test is based on a 
comparison between the sample spectral density calculated locally on a moving window of data 
and a global spectral density estimator based on the whole stretch of observations. Asymptotic 
properties of the nonparametric estimators involved and of the test statistic under the null hy- 
pothesis of stationarity are derived. Power properties under the alternative of a time-varying 
spectral structure are discussed and the behavior of the test for fixed alternatives belonging to 
the locally stationary processes class is investigated. 
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1. Introduction 

Second order stationarity is an important assumption in the statistical analysis of stochas- 
tic processes, allowing for the development of an asymptotic theory capable of investigat- 
ing properties of many classical statistical inference procedures. However, the assumption 
that the second order characteristics of a process remain constant over time is often not 
justified in practice. Many time series show a non-stationary behavior due to smooth 
changes of their second order structure over time. Based on this observation, interest has 
been directed toward the analysis of stochastic processes whose spectral characteristics 
change slowly over time. Priestley (1965) introduced processes with evolutionary spec- 
tra and a time- varying spectral representation similar to that of a stationary sequence; 
cf. also Granger (1964) and Priestley (1988) for an overview. During the last decade, 
statistical inference for processes showing such a non-stationary behavior has attracted 
considerable attention in the literature. In this context, asymptotic statistical inference 
has been made possible by considering arrays of double-indexed processes having time- 
varying spectral characteristics together with a time rescaling approach which allows for 
increasing information on the local structure of the underlying non-stationary process as 
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the sample size increases; see Dahlhaus (1997) and literature on the related concept of lo- 
cally stationary processes. Mallat et al. (1998) considered adaptive covariance estimators 
of locally stationary processes, while Nason et al. (2000) considered processes where the 
time- varying spectral representation and the associated Fourier basis have been replaced 
by a representation with respect to a wavelet basis. The aforementioned developments in 
the statistical analysis of locally stationary processes make it important to have powerful 
statistical tools that detect possible changes in the spectral structure of a time series and 
evaluate the appropriateness of the prcassigncd stationary process class. 

This paper deals with nonparametric tests of stationarity against the alternative of a 
time-varying spectral structure. The idea underlying our approach is to compare local 
sample spectral density estimates based on a moving window of data with a global 
estimate using the whole set of observations, and to evaluate the difference between 
the two quantities over the different frequencies and time segments considered using an 
appropriate ij2-typc distance measure. To be more specific, the proposed test is based 
on a smoothed version of the local sample spectral density (local periodogram) rescaled 
by a global spectral density estimator obtained using the whole stretch of data. For a 
given time point, the corresponding smoothed statistic squared and integrated over all 
frequencies is a measure of deviation between the local spectral structure of the observed 
time series and a global spectral structure associated with the hypothesized stationary 
behavior. Calculating this local quadratic deviation measure for different instants of time 
and building a time-averaged version lead to a global measure of stationarity which is 
used for testing purposes. 

To allow for a rigorous asymptotic investigation of the testing methodology proposed 
under the null and under different non-stationary alternatives, the theoretical framework 
of locally stationary processes is adopted; cf. Dahlhaus (1997). The asymptotic behavior 
of the test statistic under the null of stationarity is derived and its weak convergence to a 
Gaussian distribution is established. Consistency of the test is proved and its asymptotic 
power behavior is investigated for fixed alternatives belonging to the class of locally 
stationary processes. Although some of the results obtained under the null of stationarity 
can be derived under different assumptions on the underlying stochastic process, the 
aforementioned theoretical context of locally stationary processes is preferred because it 
delivers a unified set-up for asymptotic investigations of the test proposed both under 
the null and under fixed alternatives. 

Procedures for testing the constancy of particular characteristics of a process over time 
have been considered in the literature under various settings and approaches with their 
main focus directed toward testing the constancy of specific parameters or characteristics 
in a more or less restrictive parametric set-up. The more general nonparametric problem 
of testing the constancy of the entire spectral structure of a process without imposing 
any parametric assumptions on the underlying process class has attracted less interest 
in the literature and only a few studies of this exist. 

Priestley and Rao (1969) proposed testing the homogeneity of a set of evolutionary 
spectra evaluated at different instants of time using logarithmic transformations and an 
analysis of variance framework. For Gaussian processes and for the particular purpose 
of a change-point detection, Picard (1985) developed a test based on the difference be- 
tween (possible time- varying) spectral distribution functions estimated on difi'erent parts 
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of the series and evaluated using a supremum-type statistic. Giraitis and Leipus (1992) 
generalized this approach to the case of linear processes, von Sachs and Neumann (2000) 
developed a test of stationarity based on empirical wavelet coefficients estimated us- 
ing localized versions of the periodogram. In a more restrictive context, Sakiyama and 
Taniguchi (2003) considered testing stationarity versus local stationarity in a parametric 
set-up. Fitting piecewise autoregressive processes has been also used to detect change- 
points in the context of time series; Kitagawa and Akaikc (1978) used this approach 
together with AlC-based order selection, while Davis et al. (2005) suggested fitting piece- 
wise autoregressive models using a minimum description length procedure together with 
a generic algorithm to solve the difficult optimization problem. 

Compared to the last two approaches mentioned, the method proposed in this paper 
is entirely nonparametric in that it does not rely on distributional or parametric as- 
sumptions on the underlying process class and it is not only focused on the detection 
of change-points but also on smooth changes of the spectral structure. Unlike Picard's 
(1985) proposal focused on change point detection and which is based on the integrated 
periodogram and supremum-type statistics, our method is based on smooth (kernel) es- 
timates of the (possibly) time- varying spectral density function of the underlying process 
and uses L2-type distance measures. Compared to the rather heuristic derivations of 
Priestley and Rao (1969), our set-up allows for a thorough theoretical investigation of 
the asymptotic properties of the proposed test both under the null and under fixed al- 
ternatives. In addition to the aforementioned power investigations, our testing approach 
differs from that of von Sachs and Neumann (2000) in that it uses global measures of 
deviations from stationarity, thereby avoiding a multiple testing procedure based on a 
selected set of estimated wavelet coefficients with conservative critical values obtained 
via Bonferonni's inequality. 

The paper is organized as follows. Section 2 contains the main assumptions imposed 
on the underlying stochastic process class and introduces the basic statistics used in the 
sequel. In Section 3, the statistic proposed to test the null hypothesis is presented and 
its asymptotic behavior under the null of stationarity is investigated. Section 4 deals 
with the power properties of the test and investigates its behavior for fixed alternatives. 
Section 5 contains some practical guidelines on how to choose the testing parameters 
and demonstrates the capability of our method to detect departures from stationarity by 
means of a numerical example. Proofs of the main results are deferred to Section 6. 

2. Basic statistics 

2.1. Preliminary considerations 

To provide a theoretical set-up allowing for stochastic processes showing time-varying 
spectral characteristics and to state rigorously the null and alternative hypothesis of 
interest, we adopt the framework of locally stationary process (see Dahlhaus (1997) and 
Dahlhaus and Polonik (2003)) which is flexible enough and covers several interesting 
cases. Since we restrict ourselves to questions related to the temporal behavior of the 
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spectral structure of a process, it seems natural to consider the general class of time- 
varying linear processes as the appropriate underlying class of processes. 

Consider a triangular array {X„, n £ N} of stochastic processes, where X„ = {Xt^n, t = 
1, 2, . . . , n} satisfies 

Assumption 1. 

(i) Xt^n has the representation 



Xt,n= ^ at,n{j)et-j, 



(2.1) 



where the {st} are i.i.d. with E{et) = 0, E{£l) = 1, E{e\) < oo and K4 = 77 — 3, where 
ri = E{ei); 

(ii) supj |at_„(j)| < Kl ^{j), where K is a non-negative constant independent of n 
and the positive sequence G Z} satisfies X]jl-oo \j\^~^U) < 00/ 

(iii) functions a{-,j) : (0, 1] — >■ M with 



sup 

tiG[0,l] 



< 



K 



exist such that 



sup 

l<t<n.neN 



< 



K 



(2.2) 



(2.3) 



Let At_„(A) = at, n(j) exp{—iAt} and denote by f{u,X) the time-varying spec- 



tral density /(u,A) = {2n)-^\A{u, X)]"^ , where A{u,X) = X]' 



i(m, j)exp{— ijA}. Note 



that by the smoothness properties of a(-, j), the triangular array {X„,n € N} uniquely 
determines the local spectral density /(?i,A), and that (2.3) implies 



sup 



^At,„(A)^t,„(A)-/(t/n,A) 



O(n-i). 



(2.4) 



Let Tls be the class of stochastic processes satisfying Assumption 1 and let Ts, J^s C 
^LS, be the subclass of stationary processes, that is, 

{00 00 
X„ :Xt,„ = a{j)et-j Vi and n e N and ^ b1|a(j)| < 00 i. 
j — — OQ j — — OQ J 

Given observations Xi_„, X2,„, . . . , X„_„, the problem considered is that of testing the 
hypothesis that the spectral structure of the underlying process is constant over time. To 
precisely define the corresponding null and alternative hypotheses, consider for A e [— tt, tt] 
the time-averaged local spectral density g{X) defined by 



5(A) 



/(w, A)du. 
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By the smoothness properties of /(-jA), 51(A) is well defined and has all properties in 
common with the spectral density of a real-valued stationary process, that is, it is sym- 
metric, non-negative definite and satisfies j^^g{\) dA < 00. Furthermore, /(w, •) = g{-) if 
and only if for every A £ [— tt, 7t], f{u,X) is an a.e. constant function of the time variable 
uG [0, 1]. We require that g also satisfies the following assumption. 



Assumption 2. 



inf g(A)>0. (2.5) 

Ae -7t,7t 



Based on the previous considerations, the testing problem investigated is described by 
the following pair of null and alternative hypotheses 

-ffo :/(-",-)=. 9(0 a.e. 

vs. (2.6) 
Hi:X{{u:f{u,-)^g{-),ue[0,l]})>0, 

where X{A) denotes the Lebesgue measure of the set A G B{[0, 1]). 

Let /„(A), A £ [— Tt, 7t] be the periodogram calculated using the whole time series, that 

is, 

2 



t=i 

and let be a kernel function /\ : M — > M satisfying the following assumption. 

Assumption 3. K is Lipschitz continuous, bounded and symmetric with support [— tt, tt] 
such that {2n)^^ J^^K{x)dx = l. 

Consider the smoothed periodogram 

where ujj ~ 2nj/n, j = 0, ±1, ±2, . . . , Kh{-) = h-^K{-/h) is the scaled kernel and h>0 
the bandwidth. It is well known that if the null hypothesis Hq is true, then 

sup |5^(A)-/(A)|^0, (2.8) 

Ae[-7t,7t] 

in probability, where /(A) = {2n)-^a^\A{c-'^)\^ , ^(0"'^) = X^^l-oo "0') Gxp{-ijA}, de- 
notes the time-invariant spectral density of the underlying stationary process. The above 
result can also be extended to the case where the alternative Hi is true, that is, where X„ 
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is locally stationary with a time- varying spectral structure in the sense of Assumption 1 . 
In particular, as Lemma 6.1 shows, we have, in this case. 



sup 

Ae[-7t,7tl 



9hW - / ./(w, A)du 



(2.9) 



in probability, that is, for locally stationary processes, 5/1 (A) is a uniformly consistent 
estimator of the time-averaged local spectral density g(X). 



2.2. Smoothed rescaled local periodogram 



To construct out test statistic, let m„ be a positive integer such that < m„ < n and 
consider for A G [— tt, tt] and u G (0, 1) the local periodogram Im.„ (w. A) calculated using a 
subset of m„ tapered observations around location u, that is. 



2txHo 



t=i 



t,m„^t+[un]-M„-l,n'^ 



-iXt 



where M„ = mn/2 and u G (0, 1) such that M„ -I- 1 < [un] <n — A/„ + 1. 
Furthermore, ht,n = h(t/n) is a taper function and 

n 

HuA>^)=Y,h{slnfe-'^' 

s=l 

its Fourier transform. Write Hn{-) for i?i.„(-). 



Assumption 4- The taper function /i:M— > [0,1] is of bounded variation and vanishes 
outside the interval [0, 1]. 

Note that the use of taper reduces the bias of the local periodogram due to the well- 
known leakage effect and it simultaneously reduces the bias due to the (possible) non- 
stationarity of the series. For the asymptotic theory developed in this paper, the use of 
such a taper is necessary in order to control the bias of the local periodogram, under 
both the null and the alternative. Note that we do not use a data taper to calculate the 
global periodogram /„(A) based on the whole stretch of observations mainly because we 
would then get 

n 

EiUXk)) = F2,„(0)-i h\t/n)flt/n, Xu) + 0{n-^ log(n)), 
t=i 

which implies that under the assumptions made, the smoothed global periodogram g/i(A) 
does not converge under the alternative to the desired limit g{X) = /(u,A)dw. 
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Now, the basic statistic used in the sequel is the following kernel-smoothed version of 
the rescaled local periodogram 

where Aj = 27Tj7m„, = 0,±1,±2, To understand the motivation leading to A), 

consider 

A) = ^ 5: K,(\ A,) l"^!-!^ - i) , (2.11) 

which differs from T4,(u,A) by the fact that the true function g[X) is used. Under the 
assumptions made, and if the null hypothesis is true, then y„(u,A) is a nonparametric 
(kernel) estimator of the mean function E{Im„{u, X) / f {X) — 1) = 0(m~^) that converges 
to zero uniformly in u and A as m„ — >■ oo. Thus, under the null hypothesis, we have, for 
all u e (0, 1) and A <E [— 7t, tt], that 

y„(w,A)^0 

in probability. On the other hand, if the alternative is true, that is, if the spectral structure 
of the underlying process varies over time, then it follows using Lemma 6.1, that 

^"^"'^^^(/o/LA)d."0 

in probability, where the limiting function on the right-hand side is different from the zero 
function because f{u,-) = f{u, ■) du if and only if f{u,-) is an a.e. constant function 
of the time parameter u G [0,1]. Thus, under the null hypothesis of stationarity, we 
expect Vn{-,-) to be close to the zero function, while this will not be the case under the 
alternative. 

Note that for a given u £ (0, 1), the statistic (2.10) has the same asymptotic behavior 
as the statistic Vn{-) in which the true function g appears. This is true because (2.10) 
is based on observations within a window of length to„ while gh is calculated using the 
whole stretch of n data points. In fact, Vn{u, A) = Vn{u, A) -I- Op(sup;^^. \9h{Xj) — g{Xj)\). 
For statistics based on time-averaged versions of (2.10), such as those considered in the 
sequel, we show that under appropriate conditions on the behavior of the smoothing pa- 
rameters involved, the effects of using the nonparametric estimator g^ are asymptotically 
negligible. 
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3. Testing the null hypothesis 
3.1. The test statistic 

To make the aforementioned behavior of the statistic Vn{u,X) useful for testing the 
nuU hypothesis of stationarity, let < mi < M2 < • • • < un < 1 be a set of = N{n) € N 
distinct points in the interval (0, 1). Set uq = and let dj^n = % — uj^i , j ~ 1,2, . . . , N he 
the distance between two consecutive points. Taking equidistant points, that is, dj^n = 
dn = {N + 1)'-^, a choice of Uj could be Uj = tj/n, where tj = S{j — 1) + m„/2 for 
j = 1,2, . . . , N . The positive integer S ~ S{n) denotes the shift from segment to segment 
while n — S{N — 1) + m„. Using a squared deviation criterion, a useful approach to test 
the null hypothesis is to average over the different time points and over the different 
frequencies the squared statistic Vn{ts/n,X). This leads to the test statistic 

considered in the sequel. Note that due to averaging over N distinct time points, T„ is 
an overall measure of deviation of the local spectral structure of the process from its 
hypothesized global behavior under the null hypothesis. 

To investigate the asymptotic behavior of T„ under the null of stationarity, the follow- 
ing assumptions are imposed. 

Assumption 5. As n^oo, 

(i) h^n-^ for some 3/20 <X<l/3; 

(ii) b^m-^; 

(iii) Sn = [nin/c], where c is a fixed positive integer such that 1 < c < m„; 

(iv) TO„ ^ for some 5 G {5i,52), where 5i = max{l/(3 — A), A/(l — A)} and 82 = 
min{(8A - 1)/(1 - A), (1 - 2A)/(1 - A)}. 

Some comments concerning the above assumptions are in order. Part (i) and (ii) ensure 
consistency of the corresponding global and local spectral estimators. For simplicity of 
derivations, we allow both smoothing bandwidths h and b to converge to zero at the 
same rate, although different rates arc also possible. Part (iii) puts some conditions on 
the behavior of the shift Sn between the different segments used to calculate the statistic 
Vn (us , A) . The condition m„ > Sn seems reasonable since it makes no sense to omit 
data (m„ < Sn)- Although the requirement for a fixed 1 < c < m„ is not restrictive in 
practice, it is necessary from a technical point of view in order to control the degree 
of overlapping between the segments used. Part (iv) implies that m„/n — > as n ~^ 00, 
from which it follows, because of m„ = cSn and NSn ~ 0(n), that the number N of 
time points Uj — tj/n at which Vn{uj,X) is calculated increases to infinity as n — >■ 00. 
This is important for a good power behavior of the test since, in this case, the (rescaled) 
distance c?„ = Sn/n between the time points tj/n in the interval (0,1) over which the 
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behavior of Vn{tj/n,X) is evaluated goes to zero as n— >■ oo; cf. Section 4. However, part 
(iv) introduces some restrictions regarding the rates at which the time window to„ and, 
consequently, the number of time points N, increase to infinity as the sample size n 
increases. Essentially, these conditions prevent both m„ and N from increasing too fast 
with respect to n and seem necessary in order to make the effects of estimating the time- 
integrated spectral density g on the distribution of the test statistic T„ asymptotically 
negligible. 

Theorem 3.1. Let Hq be true and Assumptions 1-5 be satisfied. As n— >-oo, 

m„V7V6T„-/x„^AA(0,T2), 



where 



fin = ^/_ff 
9 2 



J K^ix)dx + y/Nb(^-^J {K ^ K){y)dy + 2nK. 



Mt. 



H 



7t^ 



HZ 



E 



27t 



2n 



h^ix^h^ix+lsl/c) dx 



l[OS-\s\/c] 

Mh = H^l Hk = Jq h^{u) du and K * K{-) denotes convolution of the kernel K . 

The fact that the centering sequence in the above theorem depends on the fourth 
order cumulant K4 of the error process is due to time-averaging the weak and 0(to~^)- 
vanishing covariance of the local periodogram ordinates at different Fourier frequencies. 
To implement the statistic T„ in practice, a consistent estimator of K4 is needed. Such a 
nonparametric estimator is given by 



Ki^(^ng2M{0)-An j ?^(A)dA^ j (^j gh{\)d\ 



(3.2) 



where g2,h{-) is the estimated spectral density of the squared process X^„; cf. Janas 
and Dahlhaus (1994) and Grenander and Rosenblatt (1956). Based on Theorem 3.1, 
an asymptotically a-level test, a € (0, 1), is obtained by rejecting the null hypothesis if 
mnV NbTn > + '''^a , whcrc Za dcnotcs the upper a-percentile of the standard Gaussian 
distribution and /2„ is the estimator of fin obtained by replacing K4 by (3.2). 



4. Power properties 

We begin our power investigations with the following theorem which establishes consis- 
tency of the proposed test. 
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Theorem 4.1. Suppose that the triangular array {X„,nG N} possesses a local spectral 
density f{u,X), /(•,•) G L2([0, 1] x [— 7t,7T]). Assume that the set {u:uG [0, 1], /(m, A) ^ 
g(A)} C [0, 1] has positive Lebesgue measure and that the following conditions are satisfied: 



5/i(A) — > g{X) in probability, limsup / g/i(A)dA< / .g(A)dA 



and 



/ ™« ^ X! ^''^^ ~ ^j)Im„ (it, Aj) - /(it, A) 



dA = Op(r„) 



uniformly in u, where rn is a zero sequence. Then, as n ^ oo, 



r„ ^ 



.9(A) 



1 dAdw 



in probability. 



Notice that the above conditions arc satisfied if, for instance, the process under the 
alternative is a locally stationary process in the sense of Assumption 1. Observe fur- 
ther that the limit is an L2-measure of deviation of the locally stationary process 
with time- varying local spectral density f{u, A) from the null, that is, from a station- 
ary process with spectral density g{X) ~ f{u, A) du. Recall that g{X) is a proper 
spectral density and note that g{X) can be interpreted as the time-invariant spectral 
density which best approximates the time-varying local spectral density f{u,X) in the 
sense that for each frequency Ae [— 7t, 7t], the function g{X) = f{u,X)du satisfies 
g{X) = argmin^ Joifi'^j A) — c)^ du. Positive values of are tantamount to a deviation 
of the locally stationary process {X„} from its best approximating "stationary coun- 
terpart" , where the latter can be understood as a zero-mean stationary process with 
autocovariance function 7(-) : Z R given by 'y{h) = J^^giX) exp{i/iA} dA. 



4.1. Locally stationary alternatives 

Although Theorem 4.1 provides some useful insights into the consistency properties of 
the proposed test, an asymptotic analysis of its behavior for fixed alternatives is very 
informative and important since it leads to useful approximations of its power function 
and of the probability of the type II error. In the following, we investigate the power 
behavior of the test for fixed alternatives that belong to the class of locally stationary 
process. For this, the following assumption is imposed. 

Assumption 6. As n-^oo, 

(i) h^n~^ for some 1/4 < A < 1/3; 

(ii) 6~m-^; 
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(iii) Sn = [nin/c], where c is a fixed integer such that 1 < c < to„; 

(iv) m„-n'^ for some 6, 1/2 < (5 < min{(8A - 1)/(1 - A), (1 - 2A)/(1 - A)}. 

The lower bound in part (i) above implies that the bandwidth used to obtain the 
nonparametric estimators of the time- integrated function g{X) and of the rescaled lo- 
cal spectral density f{u,X)/g{X) is relative small, that is, the bias of the corresponding 
estimators is of a smaller order of magnitude than their variance. Relaxing this assump- 
tion to 3/20 <A< 1/3 (cf. Assumption 5) will introduce an additional bias term to the 
centering sequence of the asymptotic distribution given in Theorem 4.2 below. 

Theorem 4.2. Suppose that X„ G J-^s \ ^s, that Assumptions 1-4 and Assumption 6 
are satisfied and that E\£i\^ < oo for all k eN. If n ^ oo, then 



where 



V^iT„ - bl - Dl) ^ AA(G, 

1 r..2..^. r/'KA) 



2TOn„ b 



Mh I K^{x)Ax f f l^^dXdu, 

J-n Jo J-n 5 (A) 



and = 87z{vi + fjV2 ~ 2^3), with 

vi^2 f v^{X)g^X)dX+ f I v{Xi)v{X2)g4{Xi,-Xi,X2)dXidX2 



w{u, A)/(it, A) dA 



J — 7t J — 7X •J — 7T 

X)f^{u, A) dAdw + K4 

Jq J-n 

V3 = 2 [ [ v{X)w{u,X)f'^{u,X)dXdu 
Jq J-n 

+ K4 / u(Ai)w(it,A2)/(w, Ai)/(m, A2)dAidA2dM, 



du. 



1 f f{u,X) \ fiu,X) f f{u,X) \ 



H = H^/Hl = H^^Y.\s\<cS[os-\s\/c]h''{^)h\x + \s\/c)dx, 54(Ai,A2,A3) = 

/g /4(M,Ai,A2,A3)du and fi{u,Xi,X2,X3) = KiA{u,Xi)A{u,X2)A{u,Xj,)A{u,Yjl=i K) 

the fourth order local cumulant spectrum. 

By the above theorem, asymptotic normality of the test statistic T„ still holds under 
fixed alternatives, but with a different rate of convergence than under the null of stationar- 
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ity. In fact, while under the null hypothesis the variance of T„ is of order 0(m~^-/V"^6^^), 
under fixed alternatives, the variance of the test statistic is of order 0{m~^N~^). 

Theorem 4.2 is useful for calculating, by means of numerical integration, the asymp- 
totic power of the test for fixed alternatives belonging to the locally stationary process 
class considered. This is important if, for instance, one is interested in quantifying the 
asymptotic probability of the type II error associated with using a stationary model. In 
particular, if /(u,A) is the true local spectral density, then the probability of rejecting 
the null hypothesis is approximately equal to 



P{Ho is rejected) ~ 1 



« 1 



« 1 

where v = Vv^, with v^, d"^ and as given in Theorem 4.2 and fin and t as given in The- 
orem 3.1. The second approximation above follows using fin/\/'m„b — y/rrinNS^ = \/N / 
{y^b)MH{2n)-^D^ + 0{N^^^mn^^'^). Notice that the power of the test is dominated 
by the term which is a discrete-time version of the quadratic deviation given in 
Theorem 4.1 and that •^/irinN goes to infinity at a y^-rate as ti — > oo. 

5. Applications 

5.1. Some guidelines for choosing the testing parameters 

While the results of the proposed testing procedure seem to be less sensitive with respect 
to the particular choice of the taper function h{-) and the kernel K{-), the choice of the 
other tuning parameters is more influential and more difhcult to solve from a theoretical 
point of view. Based on some rather heuristic considerations, in this section, we provide 
some guidelines on how to choose these parameters in practice. 

Concerning the smoothing bandwidths h and b, one approach is to choose these pa- 
rameters in a way which leads to "good estimates" of the unknown functions g{X) and 
f{u, X)/g{X), respectively. A procedure for selecting the smoothing bandwidth in the con- 
text of spectral density estimation for stationary process is offered by the cross-validation 
criterion proposed by Beltrao and Bloomfield (1987); see also Robinson (1991). Adapted 
to our context and under the null hypothesis, such a rule implies that h can be chosen 
as the minimizer of the objective function 

cy(h) = ±Y^Lg{g-Hco,)) + i^], 



-4U^~V^Dl-'4^L^n-\\ (4.1) 
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where g ^ {ujj ) is the leave-out-j version of the estimator 'gh [ojj ) given by g ^ {ojj ) = n ^ x 
J2seL„ j Kh{ujj - Ws)/„(a;s) and L„j = {s : -L„ < s < i„ and j - s ^ ±j inodL„}. Al- 
though the above approach modified appropriately can also be applied to select the 
bandwidth b used within each segment, a computationally simpler rule can be obtained 
via the following considerations. Suppose that the global bandwidth h behaves under the 
null of stationarity like h ~ Cn^^, where A is known and the constant C depends on char- 
acteristics of the unknown function g{X) = f{u, A). Suppose, further, that the bandwidth 
b has a similar behavior to h, but adjusted for the smaller number of observations, that 
is, b~ Cm^^^ . Then, for h chosen, b can be selected according to the rule 6 = /i(n/m„)'^. 

Regarding the choice of the segment length m„ and the number of time points N used, 
Assumption 5(iv) and Assumption 6(iv) impose some restrictions on the rates at which 
these parameters have to increase to infinity. Notice that the number of time points N 
is automatically determined by the choice of the segment length to„ and of the shift 
Sn, that is, of the constant c. For instance, for A = 1/4, 5 is allowed to vary in the 
interval (1/2,2/3). Within the allowed range of possible values for 5, we stress the fact 
that 5 should be chosen on the one hand large enough in order to have within each time 
segment a sufficient number of observations for nonparametric estimation. On the other 
hand, 5 should not chosen too large in order to have sufficient information on the local 
structure of the series, that is, a sufficient number of N distinct time points. Recall that 
c determines the degree of overlapping of the segments used and that this parameter 
should be kept rather small, that is, 1 < c < 3. 



5.2. A real-life data example 

We demonstrate the capability of our method to detect changes of the spectral structure 
of a time series by means of a real-life data example. The data analyzed consists of the 
first 3072 observations of a set of tremor data recorded in the Cognitive Neuroscience 
Laboratory of the University of Quebec at Montreal, the purpose of the study being 
to compare different regions of tremor activity coming from a subject with Parkinson's 
disease. It has been previously analyzed by von Sachs and Neumann (2000). A plot of the 
first differences of the series is shown in Figure 1 (a) after a Gaussian noise of standard 
deviation 0.01 has been added in order to break the discrete nature of the data. 

We apply the test proposed using a local window of length to„ — 256 observations and 
a shift Sn = m„ (c = 1), that is, we consider = 12 time points. To smooth the global 
and local periodogram, the Bartlett-Priestley kernel has been used with bandwidths 
h = 0.1 and 6 = 0.18, where h has been chosen as the minimizer of CV{h). For this 
choice of parameters, we get {rrinV NbTn — 'p"n)/T ~ 21.16, which, compared with the 5% 
critical value of the standard Gaussian distribution, clearly leads to a rejection of the 
null hypothesis. 

To better understand the reasons leading to the above rejection of stationarity, it is 
useful to consider the statistic Qn{u) defined by 

/•TT 

Qn{u)^mnVb V,^{u,X)dX, ue(0,l), (5.1) 
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which evaluates, at any time point u G (0, 1), the difference between the locahy calculated 
sample spectral density A) and its globally estimated counterpart ghW- Notice 

that the test statistic r„ is just a time-averaged version of Qn{u) calculated for N 
different instants of time. Validity of the null hypothesis corresponds to small values of 
Qn{u) while large values of the same statistic pinpoint time regions where changes in 
the spectral structure of the series occur. Figure 1(c) shows a plot of Qn{t/n) against 
the rescaled time variable t/n. As this figure clearly indicates, the main contributions to 
the test statistic T„ leading to the rejection of the null hypothesis are coming essentially 
from two different time segments. Based on the behavior of Qn{ts/n), these segments are 
easily identified as follows. The first segment starts approximately at t = 1760 and ends at 
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(c) (d) 

Figure 1. (a) Plot of the first differences of the tremor series (n = 3071). (b) Estimated 
spectral densities for different segments of the tremor series: the solid line refers to the es- 
timated time-averaged spectral density Ph(A), the dashed line to the estimated spectral den- 
sity for the time segment t£ {1760, 1761, 2170} and the dotted line for the time segment 
t G {2350, 2351, . . . , 2840}. (c) Plot of the statistic Q„{us) against the rescaled time parameter 
Ua = ts/3071, ts = 129, 130, . . . ,2942. (d) Estimated spectral density of the first time segment 
consisting of the observations Xt for t € {1,2, ... , 1759}. 
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t = 2170, while the second starts at approxnTiately t = 2350 and ends at t = 2840. Figure 
1(b) shows the estimated time-averaged spectral density g/i(A), as well as the estimated 
spectral densities obtained using observations within each one of the two identified time 
segments. Figure 1(d) shows the estimated spectral density obtained using observations 
of the first part of the series. As is clearly seen from these exhibits, an increase in the 
variability of the series, together with a reallocation of the total power toward low- 
frequency components, makes the behavior of the spectral densities in the two identified 
time segments different from the overall behavior and from the behavior during the first 
part of the series. 

6. Auxiliary results and proofs 

Lemma 6.1. Suppose Assumptions 1 and 3 hold. As >oo, 

(i) E{ghiX) - Jq f{u,X)du)^ = 0{n-^h-^) + 0{h'^), where the O(-) terms are uni- 
formly in Xg [— 7T, 7t]; 

(ii) sup^g \ghiX) - /q f{u, X) du\ in probability. 

Proof. To prove (i), let 



and A„i(A) = J2T=i exp{— iAs}. Note that |A,„(A)| < 7tL,„(A), where the function L,n{uj) 
is the periodic extension of Lm{uj) = nl||ij|<„-i}(aj) + |w|^^l{„-i<|;j|<„}(a;); cf. Dahlhaus 
(1997). Using summation by parts, we get 



J2Mt/n, Afc)e-'(^-^'^)* = Ail, Afc)0(L„(Afc - A)) + o(sup \A'{u, Afe)|L„(Afc - A) 




n 




(6.1) 



0(L„(Afc-A)). 



Furthermore, because 



n 



^(At,„(A)- A(V?i,Afe))e 



i(A-Afc)t 



i=l 



n n 



< ^ \At,nW ~ A{t/n,X)\ + ^(A(i/n, A) - A(i/n, Afe))e-'(^-^^)* 




<nsnp\At,niX) - A{t/n,X)\+snp\A'iu,X)\\X~ XkWAniXk - X)\ 




<K-\l + \Xk~X\Ln{Xk~Xj), 
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-i(A-Ak)t 



t=l 



0(1). 



(6.2) 



By (6.1), (6.2) and the fact that /"^ i:„(A - A^) dA = 0(ln(n)) (cf. Dahlhaus (1997), 
Lemma A. 4), we get 



E{IniXk)) = cum(d„{\k)-,dn{-\k)) 

n 71 



1 r'^ _ _ 

- / EE^*."W^4:^e-'(^+^'='(*-^'dA 

-^^^ t=l s=l 

— / EE ^(^/"' Afe)e-'(^+^^)(*-^) dA + 0{Hn)n-^) 

J — 7T J. 1 1 



27m 
1 



/(M,Afc)du + 0(ln(?i)7i-i). 







Let uj = \j and A = Afc and consider 

Co?;(/„(A),/„(a;)) = citm(d„(a;),d„(A))cMm(d„(-a;),d„(-A)) 
+ cum{dn {L^)dn {~\))cum {dn{~Uj),dn{X)) 
+ cwm((i„(a;),d„(-a;),d„(A),d„(-A)). 

For w 7^ A, we have, for the first term in (6.3), 



(6.3) 



n n—s 

(d„(c^),d„(A))-— E E C'0«(^.,n,^..+fc,n)c-'"V 

s— 1 k——n-\-s 

^ n n—s 

— E E c(s/n,fc)e-'"'^e-'(^+")-^ + 0(n-i) 

s — 1 A:— — ri+s 
^ / n—ln—k 

— (EE^(^/->fc)e-^ 



cum 



2nn 



\k=a s=l 



from which we conclude, using 

— In — k 



iEE^(-V"'fc)^""^"^'""^^ ^^E 



fc=0 s=l 



fc=0 



E^- 



\{\-\-uj)s 



\c{s/n,k)\ 
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C 



n-l 



k—0 j — — oo 



< 



that cum 

(d„(w),d„(A)) =0(n-i/2) uniformly in Lu and A. Notice that the above argu- 
ments remain vahd if w (resp. A) is replaced by — w (resp. —A) or both. Furthermore, 

\cum{dniuj),dn{-uj),dniX),dni-X))\ 

71 n n n oo 
^""'^E E E E E \at„n{j)\\at,,n{j+t2-h)\ 

ti—1 ^2 — 1 Si — 1 S2 — Ij — — oo 

X |asi,„(j + si - ti)||as2,„(j + S2 - 

\j=_oo / 

Thus, Cow(/„(Afe),/„(Aj)) = 0{n^^) for Aj 7^ Afc, and for Xj = Afc, we get (along the 
same lines) that Var(/„(Aj)) = 0(1) uniformly in Xj. Using these results, the properties 
of the kernel K (see Assumption 2) and the smoothness properties of g (see Assumption 
1), it follows by standard arguments that E{gh{X)) = g{X) + 0(/i^) and Var{gh{X)) = 
0{n~^h~^) uniformly in A. 
To establish (ii), notice that 



9h{X) 



f{u,X)du 



< 



7 = -L„ L t=l \ 

- E ^'^(A-.)-E 
-E/f-'^)f- E ^'^(^— . 

t=i ^ ^ \ J=-L„ 



+ 0(n-i). 



The first term on the right-hand sight goes to zero because of part (i), Markov's inequality 
and a standard discretization argument (cf. the proof of Theorem Al of Franke and 
Hardle (1992)). Using |/(u, Ai) — j{u,X-i)\ < C\Xi — A2I, the second term is bounded by 
Cn~^ X]t=-L Kh{X — ujj)nh = 0{h). For the third term, we have, by Assumption 



(ii:..<.-.,-:)ii:/(Hhlfi:<^ 



A =0{n-'h-'). 



□ 
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Lemma 6.2. Let Assumption 1 be true and E\ei\'' < oo for k eN. Then 

cum(d„(Ai), d„(A2), . . . , d„{Xk)) = ^[^^ ^ A f -, Ai, A2, . . . , Afc_i j e~"^'=i ^' 

where the error term is uniform in Ai, A2, . . • , Afc and for u G [0, 1], 

Ai, A2, . . . , - A2) • • • Afc_i)^ A.^ 

is i/ie kth order local cumulant spectrum of the process {Xt{u),t G Z}, where Xt{u) = 
Ej°l-oo a("> fl"^ ^("' ^) = i) exp{-ij A} . 

Proof. We use 

cMm((i„(Ai),d„(A2), . . . ,d„(Afe)) 

,-i(Aa-LJs)t 



cumk[et) 



tu...,tk-' s=l 



and successively replace At^^n{i^s) by ^(^s/^, '^s) for 5 = 1,2, ...,fc— 1 and At^,n{^ X]s=i '^'^) 
by ^(ts/n, -X^s^r ^s)- We then get that the expression on the right-hand side above 
equals 

cum,{e,) ^ XlA{tJn,Xs)o--^i---''^' 

A (tk/n, - J2 As I e-'^^'^+^-J dwi • • • dc^fc_i 



ti....,tk s=l 



\ s=l / 

n k — 1 / k—1 



E n ^(V-' A.) A ( t/n, - j: a. ) e-'* , 



t=l s=l 



plus a remainder i?„ which, using bounds similar to (6.1) and (6.2) and Lemma A. 4 of 
Dahlhaus (1997). is shown to be, after tedious but straightforward calculations, bounded 

by 

Cons<.^^^|-^^ J Y]_Ln{Xs -ujs)du}iduj2--- diVk-i = 0(ln''"^(n)/n''/^). 
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We next show that under the assumptions of Theorem 3.1, replacing 5h(A) by 
g{X) = /(A) does not affect the asymptotic distribution of Tn- For this, write un- 
der the nuU hypothesis fh{X) for g/i(A) and let L„(u, A) — (/„i^ (u, A)//(A) — 1) and 
Gn(A) = (A(A)//(A) - 1). Further, let 



(6.4) 



where Vn{u,X) is defined in (2.11). We then have the following lemma. 

Lemma 6.3. Suppose the assumptions of Theorem 3.1 hold. As n-^oo, 

T„-f„ = op(m-i7V-i/26-i/2). 

Proof. Using a Taylor series argument and Lemma 6.1, we get 
rUnVbNTn 

X ( — ^/ffc(A-A0(L„(us,A0 + l)G„(A0)dA 

\ nh J 

where the first Op(-) term goes to zero for 5 < [1 — 2A)/(1 — A) and the second for 
5 < (8A — 1)/(1 — A). Denoting by — 2i?„ the second term on the right-hand side of the 
above equation, wc verify that 



rUnVN 

Rn + Op 



3=1''-^ j I V / 



N 
nbh 



with an obvious definition for i?„ and where Op{^jN/nbh) — > for (5 > A/(l — A). Evalu- 
ating the expectation £'(L„(ui, Ai)G„(A2)Ln('W2, A3)G„(A4)), wc can decompose E{RnY 
into a sum of several terms, two typical of which are 

^1." = J]^EE / /EEEE^^(^i-^^^)Wi-AOif.(A2-A,j 

" si S2 n /i j2 ii 
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X Kb{X2 - A/JdAidAs 



and 



Si S2 jl h j2 h 



X Kb{X2 - A/JdAidA2 

X £;[L„(it5, , Aj JG„ (A/, )]S[L„ {us2 , Xj^ )Gn {Xh )] • 



Using S(/™„(w, A,)) = /(A,) + 0(m-2), wc get 

= ^T.Y.f f T.Y.Y.Y.^'^(^^-^n)K,iXi-^iJKbi^2-X,2) 

X iir;,(A2 - AfJdAi dA2 

X Cov{I,n^ (us, , Xj, )//(Aj J, , Aj J//(Aj2 )) 

X £;[G„(AzJG„(AiJ] +0(7V6(n/i)-i). 

Now, standard calculations (of. Theorem 4.3.2 of Brillinger (1981)) yield 



Cov{Ijn„ (Usi , Xj),Ijn„ (w^a, A;)) 

^ r/(A,)/(AO 



47r2i/2„j0)- 

^^m„ (t^l — Xj)H„i^{ — UJi + Xi)H„i„{u}2 + Xj)Hjn^{—UJ2 — A;) 
+ j J H,n,A^l- ^j)H„i^{-UJi - Xi)H,n,^{uj2 + Xj)H,n„{-UJ2+ Xl) 

X e'^^'^i-'^^f^'i+^^Mwidcja 

/4K,C^2,W3)e'^(^^-^^^("^+"^)i/™„(wi +W2 + W3 + A,) 



(6.5) 



X Hra„{-t^l - Xj)Hyn^{-UJ2 + Xi)Hr,i^{-U>3 - A()da;i dw2 diOs 

+ O(log2(m„)m,;2) 

3 

Aj,Ai)+0(log^(TO„)m„2), 
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with an obvious definition for Cr,n{usi,Us2, Xj , Xi) , r = 1,2,3 and where /4(Ai, A2, A3) = 
{2'n)-^KiA{e-'^')A{e-'^')A{e-'^'^)A{e'^^'+^^+^''>), with A{z) =J2jaij)z^ , denotes the 
fourth order cumulant spectrum, and Hm^{X) = Hi^m„{X). Substituting (6.5) into 
-Ei_„, we get by straightforward calculation that Ei^n = 0{mnb{nh)~^ + rrinh'^h). 
To demonstrate this, consider, for instance, the term of Ei,n corresponding to 
Ci_„(wsi,Us2, Aji, AjJ. Using 



1 

2n 



ff„,„ {x - yi)H„,^ {~x + 2y2)e'^(^^-^^)" Ax 



(6.6) 



^ EE ' 

tl = lt2 = l 
tl-t2 = S(si-S2) 



1l 



^1(1/1*1-1/2*2) 



and E[Gn{Xi-^)Gn{Xi2)] = 0(n ^/i ^ + /i'*), by Lemma 6.1, we get that this term can be 
bounded by 



O ml 



1 

nh 

N N 

EE 

Si=l S2=l 



mlN 

^^/^fc(Ai-A,J/u(A2-A,g 



ji 32 



1 



tl = lt2 = l 
*l-t2=S(si-S2) 



-Aj2t2) 



dAi dA2 



0{mi[— + h 
nil 



E (1 - ^) / E ^'''(^1 - X,,)K,{X, - A,J 



|r|< 



31,32 



?n„+rSl(-c<r<0) 

^ ( m .. ) ^ 

t=l+rSl(0<r<c) 



t + rS" 



dAi dA2 



Approximating the sum over the Fourier frequencies by an integral and using the Lips- 
chitz continuity of the kernel K , we get that the above term is bounded by 



O ml 



1 



y J KbiXi-x)Kbi\2-x)dXidX2 
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m„+rSl(-c<r<0) / f \ 

/ ^ (, TO ) ^ 

t=l+rSl(0<r<c) ^ 



O (^ml (^-i- + /i^^ ^ 0{bH.^X,. (0)) / ^^'^(^1 ~ 2;)i^6(A2 - x) da; dAi dAa + i?M, 



O 



nh 



m„h% I + RMr 



where the remainder i?M„ satisfies 



|i?M„| < O to: 



X j Kb{\i- x)\y-x\A\i 



rn„+rSl{-c<r<0) 



^ ( m_. ) ^ 



t=l+rSl(0<r<c) 



1 



dx dy 



= 0{n-^h-^ + h^). 



\y~^\Ltix-y) dxdy 



The function L„i{uj) appearing above is the periodic extension of Lm{^) = '^l{|tj|<n-i}(w) + 
|w|~^l{„-i<|(^l<„}(a;) and the last equahty follows using Lemma A. 4 of Dahlhaus (1997). 
To bound i?2,n! we use 

E[L^{usA3)Gn{\i)]^r\>^j)r\h)cum{I,nSu,,\,)J{\i)) + 0{h^m-'') 

and cum{Irn„ius, Xj), I„ {Xr)) = 0(to^^^^7i^^/^) for Xj 7^ A,., where the O(-) term is uni- 
formly in Us, Xj and A^. Wc then have 

S J ^ 

X CUm{Ijn^{Us,Xj)Jn{K)) 

= 0(TOy26i/2Ari/2„"i/2) = o(&i/2) 
and, therefore, i?2,n = 0(6). □ 

Proof of Theorem 3.1. It suffices to show that to„-\/ NbTn — fJ-n converges to the 
desired Gaussian distribution. Using (6.5) and E{Ijn„ {us,Xj)) = f{Xj) + 0{m~^), we get 
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X Gov [u, , A,, ) , {u, , X,, )) + 0{N'^^m-^/^) 



/bN 



mnHl^ (0) 



X {|i^2,™„(A,, + A,Jp + |i72,™„(A,, - A,J|2 + i?4,™„(0)(77 - 3)} 



+ o(7vi/Vt'/') 

with an obvious definition for the i?i_„'s. For i?i,„, we have 



^0. 



4712^1, „ JO) 

where 0{N^^^b-^^^m-^) for 5 > 1/(3 - A) and 
rUnvbN 



Kb{v + 2x)Kb{v)\H2,m^ doj dvdx - J (K * K){y) dy 



Notice that the 0(Af^/^&~^/^TO~^) above appears because by the Lipschitz properties of 
K, the fact that |i?A:,m„(A)| < CLm„(A), where the function Lm„{X) has been introduced 
in the proof of Lemma 6.1, and by Lemma A. 4 of Dahlhaus (1997), we get 



Kb(v){Kb{v + 2a; + w) - Kbiv + 2x)) |iJ2.m„ (w)|^ duj dv dx 



= 0(7Vi/26-i/2^-i). 
To evaluate i?2.n we first write 

E2,n ^ 4 J^^y^o) I / / J Kt{v){Kbiv + uj)-Kbiv))\H2,„M\''dvdxdiu 
+ 2n f [K^{v)\H2^,nAuj)fdvduj}+0{N^/^b-^/^m-^) 



As for Ei^n, for the first term on the right-hand side, we get the bound 



Z /7 / ^"^Wd^ / MH2,mA^)\'dLo = 0{N'/%-'/^m~'), 
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K'{v/b)dv / |H2,™„Hrdw 



HWN 



K^{x)dx 



The proof that \m„ VbNE{Tn) — i-in \ — > concludes at this point because 

■0. 



('7-3) ^ (7/ -3) 



To obtain the variance of to„vWVT„, notice that 
1 



«l.S2il J2 ll,l2 

X {E[Ln {Us^ , Aji )Ln (lts2 , A;i )]E[Ln {Us^ , Aj2 )i„ , A/2 )] 

+ E[Ln{us^ , Aji )L„(us2 , A/2 )]£^[L„ (usi , A-,,, )Ln[us^ , A/^ )] (6.7) 

+ CUm[Ln{Us^ Aji), Ln{Usi , Xj^),Ln{Us^ , A; J, L„ (w^^ , A/a )] } 

+ o(m-2Ar-i5-i) 
= 14,„ + 1^2,„ + V^.n + o(m-2iV-i6-i) 

with an obvious definition for Vi^n, i = 1,2,3. Using Ln{u,Xj) ~ f~^{Xj){Im„iu, Xj) — 
(u,Aj))) + 0(m-2), we get 



N N 

Sl = l S2 = l 



EE^f(^i - A,)A',(A2 - xi)r\x,)r\xi) 



3 I 



X Cow(/™„(usi, Aj),/™„(ms2, A/)) 
+ 0(m-5fe-i). 

Substituting (6.5) into (6.8) and ignoring the 0(log^ (m„)m~^) term yields 

yi,n = ^/^4 EE/ / EE^^'''(-^l ^'^j)^''('^2-A/)Cg,„(Usi,Us2,Aj,A/) 
~ 3 

E E E [ [ EE^^VAi-A,)i^,(A2-A/) 



dAidAa (6.8) 



dAi dA2 



91 = 1 (j2=qi + l Si,S2 



j I 
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■ J I 

X Cg2,„(usi,Ms2, Aj, A;) dAidA2 



+ o{m-'N-'b-') 



1=1 



with an obvious definition for V^^^^ , ^ = 1, 2, . . . , 6. Using (6.6), we obtain 



*^ l.n 



X ^(l-|r|/7V) 



\r\<c 



3 I 



m„+rSl(-c<r<0) / , \ /, r. 



ti,t2 = l+rSl(0<r<c) 



-i(Aj-A,)(ti-t2) 



(6.9) 



dAi dA2 



wliicli, multiplied with m^Nb, converges, as ri — > oo, to 



mlNbV^ 



(1) 



/i2(x)/i2(a; + |s|/c) dx 



(i^*i^)2(2/)dy. 



-2n,2n] 



A repetition of the above arguments yields that m'^NhV-^'^jl converge to the same limit. 
For the term v/^^ notice first that using \Hrn„{uJi + cj2 + W3 + Aj) — H,n^{uji + Aj)| < 
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C\uj2 +W3I and Lemma A. 4 of Dahlhaus (1997), we get that 

/ /EE^^f(^i-^^)^^'''(^i-^')i^™"(^i + ^^- 
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This, together with straightforward calculations, yields m^TVbv/^^ = 0(6). For the cross- 
product terms, we have 

'"'n ^ l.n 

_ 2mlb 

l„l ^ „ J J "^71 I 

t=l+rSl{0<r<c) \ "/ \ 



\r\<c 



ji h 



-i(Ajj-A;Jti 



dAi dA2 



— T 



|,|<,V^[0,l-N|/c] 



h^(x)h^{x^\s\lc) dx 



{K*Kf{y)dy. 



Similar arguments show that m^NbVi^jl = 0(6) and m'^NhVi'l^ = 0(6) 
Combining the previous results for V^^l, ? = 1,2,...,6, we get that 



A6) 



mlNbVi^n 



""'^2 |.|<c^-^[0.1-NIAl 



h^{x)h^{x+\s\/c) dx 



(6.10) 



X ; {K*Kf{y)dy 

' [-27t,27t] 



as 71 — > 00. Since the structure of the term V2,n given in decomposition (6.7) is iden- 
tical to that of Vi,m 'm'^NbV2,n converges to the same limit as (6.10). The proof that 
Var (m„ V NbTn ) ) — J> is then completed because m'^NbVs^n — >■ as n — 00 ; see Papar- 
oditis (2006) for details. 

To show convergence of ninV Nb{Tn — ETn) to the desired Gaussian distribution, notice 
first that as in Theorem 10.3.1 of Brockwell and Davis (1991), we have (m, Aj)//(Aj) = 
Is{u,Xj)/a^ + Rm^{u,Xj), where sup, E{Rm„{u, = 0{m-^), (u, Aj) = a(e~^^) x 
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J,{u,X)Y^„{u,\) + a{e'^)Mu,-X)Y„,^{u,-\) + \Y,nAu,\)\'', J,{u,^ 

Eraji 1 —iXt 



and 



Let 



and 



t=l 

= :<!.,KA) + C/i'i,KA). 

L„,e(w, A) = ^ i^fa(A - A,)(/,(u, Aj) - 1) 



(6.11) 



Ty„(w, A) = m,;i ^ i^f,(A - A,)i?™„ (w, A,). 

We then have m„^/]Vfe(^„ - ET,,) = N-^l"^ ^f^^ Z^,„ + i?i,„ + 2i?2,n, where 
Zs-n =m„Vb J {LI^^{us,X) - ELl^^{us,\))d\, 



s=l 

and 

TV 



i?i,„ = ^^y'(iy2(^^,A)-W„2K,A))dA 



R2,n - ^ / {Wn{Us, A)L„,,(?/„ A) - S( W„ ^ , A)i„.e (lis , A))) dA. 

Observe that {Zs n, s = 1, 2, . . .} forms a zero-mean, strictly stationary c-dependent se- 
quence since for |.si — S2I > c, the random variables Zs-^^^n and Zs2.n are based on the 
two non-overlapping sets of i.i.d. random variables {£(+[k^^„]_j\/^„i, t = 1, 2, . . . , m„} 

and {ei+[ti^^„]„7\/^_i, Z = 1, 2, . . . , TO„} respectively. Furthermore, for Var(Tn), wc get, 
for |si — S2I < c, 

\ 2 



{K*K){y)dy. 

-2n 
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Convergence of -^"^^^X^fLi ^s,n to the desired Gaussian distribution then follows by a 
CLT for c-dependent sequences; see Theorem 6.4.2 of Brockwell and Davis (1991). 

The proof of the theorem is completed because straightforward but cumbersome cal- 
culations show that Ri_n and i?2,n — > in probability; see Paparoditis (2006). □ 

Proof of Theorem 4.1. Let y(u, A) = /„„ (u, A)/.g(A) - 1, G{u, X) = {f{u, X)/g{X) - 1) 
and write r„ as 

iv 1 ^ /• 

^" = ]^E iV^i^s,X)-V^{u,,X))dX + -Y, {V'{us,X)-G'ius,X))dX 

3=1'' 3 = 1'' 

N 

+ nY. G\u,,X)dX. 

3=1 '' 

The assertion of the theorem follows because N^^^^^^ j {V^{us,X) — V^{us,X))(iX < 
Op(l)/"^ |g/i(A) - .g(A)| dA, J^^\gh{X) - g{X)\dX in probability (van der Vaart 
(1998), Proposition 2.29) and N-^J2f=i Iiy^iu,,X) ~ G^iu,,X))dX <Op{rn). □ 

Proof of Theorem 4.2. Using the notation L„(u,A) = (/,„^ (u, A)//(w, A) — 1) and 
G„(u,A) = (/(u, A)/g/i(A)-l) and substituting (it, Aj)/5/i(Aj)-l ^ Ln{u,Xj)f{u,Xj)/ 
?/t(Aj) + G„(u,Aj), wc get 

+ /"|^A'b(A-A,)i„(M„A,)G„(w„A,)| dA 

+ ^E/{E^'^(^-^.)G"(«3,A,)}'dA 

2 ^ r 

+ ^EEE / AVA-A,Jif,(A-A,JdA 

X {Ln{Us, Ajj )Ln{Us, Xj^)Gn{Us, Xj^ ) + i„(Us, Xj-^ )G„(Ws, Xj^ ) 

+ G„(us, AjjL„(us, AjJG„(ms, AjJ} 

6 
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with an obvious definition for the Ti^„'s. To establish the theorem, we show tliat, in 
probability, 

/AT /"TT 

Vm„A^Ti.„ - Mh^=t / K^{x) dx ^ 0, (6.12) 
y/mnNT2,n-MH^^7 I K\x)dx^ f f ( - l) dXdu ^ 0, (6.13) 

27tJo J-n\ gW ) 



y/m,,NTi^^ 0, (6.14) 



where AIh = H4/H2, and that 



^rUnNin.n + T5,„ + Te.n -Dl)^ AA(0, w^). 



(6.15) 



To sec (6.12), note that ^J^^^^E{Ti^n) = MnN^'^m'^'^h-^ ^'^^K^{u)du + 0{N^/^m~^'^). 
Furthermore, if Assumption 1 is satisfied, then /„j^ (it. A) = Im^ {"^i ^) + 0('™n'^^^)7 where 



/„„(u,A) = (27Tff2.,n„(0))- 



^t,>n„-^t+[Mri]-M„-l,ri exp{-iAt} 



is the tapered local periodogram of the scries Xi{u),X2{u), . . . , X„(w), where Xt{u) = 
a{u,j)et-j. By Assumption 6 and simple algebra, we get y/rUnNTi^n = aA"?JV7i_„ + 
op(l), where Tin is obtained by replacing Im„{u,X) by Im„{u,X) in Ti^„. Now, 
Var(fi,„) = 0(m-2Ar-i5-i)^ which implies that Var{^/mJ^fi^n) 0(m,7i6-i) ^ as 
n — >■ 00. 



To establish (6.13), we use f/gh — ^ = if / g — ^) — {g /dh — ^) f / g and write \/mnNT2 „ 
.(1) 

2,n 



yfii^T^^l + i?2,„, where 



b(A - Aj)L„(u5, Aj)G(us, Aj) ^ dA 



with G{u, A) = (/(u, ^)/ g{^) ^ 1) and the remainder i?2,Ti converges to zero in probability 
by Lemma 6.1 and Assumption 6 since it is of order OimU"^ N^/'^h'^ + N^/'^{nbh)~^/'^ + 
nhr!'^ N^/'^lP'h?). Now, arguing exactly as in the proof of assertion (6.12), it follows that 



5(A) 



1 dAdu-^0 



and Var{y/7K:^T^2) = 0{m-^b-^) as n ^ cxd. 



Testing temporal constancy 1219 



To show that (6.14) is true, we use the same decomposition for f fq — 1 as in (6.13) 

.(1) 

4,n 



and verify that \/ rrinNTi^n = ^/^TlnNT^^ + i?4^„, where 



2 ^ r 

V n s=l ji j2 



X Ln{Us, Xj^ )Ln{Us, Xj^ )G{Us,Xj^ ) dA, 



and, for i?2.n, the remainder i?4_„ satisfies, by Lemma 6.1 and Assumption 6, 
i?4,„ = 0{ml/^N^/^h^ + N^/^{nbh)-^/^ + ml/^ N^/%^h^) ^ as 7W oo. Now, as for 
V^TO^JlVT2^,j, the variance of rrinNT^'n is of order 0(to~^&~^), while 



i?(V^T(i,!) = -7^ / K\x)Ax^Y. I G(u.,A)dA + 0(7Vi/2,^,;V2) 
= 0(m-i/27V-i/2) + 0(7Vi/2m^i/2) g, 

by Assumption 6, where the last equality follows because G{u, A) du = 0. 

It remains to establish assertion (6.15). For this, consider the sequence of bivariate 
random variables (T^^n ~ D^,Tc, „ + 76,n)'- Straightforward calculations yield 



y = VmJ^{Jl,n,J2.ny +Op{l) (6.16) 

(see Paparoditis (2006)), where 

Ji,„ = -2 / «(A)(/„(A) - 5(A)) dA, (6.17) 



N 



J2.n^^J2 I w{Us,X){I^^{Us,X)- f{u,,\))d\, (6.18) 

s=l 

viX) = f{u, A)[/(u, A)/,g(A) - l]/ff2(A) du and w{u, A) = [/(u, A)/5(A) - l]/g(A). 
Using Lemma 6.2. we get irinN Var{Ji^n) ~^Vi a.s oo, where 

yi = 167t J v\\)g\\)dX + 8n J J i-(Ai)w(A2)54(Ai, -Ai, A2) dAi dAa 

and g4(Ai, — Ai, A2) = /J^ /4(w, Ai, — Ai, A2) dw. Furthermore, 
mnNVar{J2,n) 

4m P f ^ ^ 

^ ^w{us,,uj)w{us2,X) 

si — 1 S2 — I 

X {cM77i(d„(us^, w), d„(us2 A))cwm(d„(us^ , -w), d„{us2 , - A)) 
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+ cum{dn [usi , w)d„(its2 , ~X))cum{dn{usi , -w), d„(us2 , A)) 
+ cwm((i„(usj,w),d„(us^, A),(i„(us2, -A))}du;dA 

= Wi^n + W2,n + W3,n 

and an obvious definition for Wi^m « = 1,2,3. Analyzing eacli term separately, we get 

Win^Wi = ^y^ I h^[x)h'^[x+\s\lc)dx I [ w'^{u,X)dXdu, 
^2 ^^J[os-i^] Jo J 

where w(u,X) ~w{u,X)f{u,X). W2,n has the same limit as Wi,„, while 

W3n^W3 = ^y f h\x)h^{x+\s\/c)dx r wiu,X)dx] du. 



Finally, since 

N 



s=l 



4m„y^ / / -(;(Ai)w(Ms,A2)cum(d„(Ai),(im„(its,A2))cM77i((i„(-Ai),d,„„(u5,-A2))dAidA2 



4™, ^ 



' s—l 



[«sn] + A/„ + l 

X ^t-[tisn] + Af„-l,m„'l/-[tian]+A/„-l,m„ 

t,;=[«an]-A/„+2 



X a(-,x,)a(-,X2)a(-,-x,)a(-,-X2 

xe-HAi+A.)(t-0dA<iA2 

+ 0(l0g(TO„)/m-n) 

^C[^l^8nj^ J v{X)w{u,X)f{u,X)dXdu 

and 

4m„^ / / w(Ai)w(us,A2)cMm((i„(Ai),d„(-Ai),d„„(us,A2),dm„(us,-A2))dAidA2 
s=i 

/ / ^'(Ai)w;(u5,A2) 



ni/2,m„(0) ^ 
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[usn]+A/„ + l 
t=lusn]~M„+2 



"■t-lu,n]+M„ 



A -,Ai 
n 



Ai -,\2 
n 



+ 0(log(m„)/m„ 



^ ^1.2 



:87t 



v{Xi)w{u, X2)f{u, Ai)/(u, Ai) dAi dA2 du, 



we get rUnN Cov{Ji,n, J2,n) Ci_2 = — 2C;[^2 ~ ^"^^2. Using Lemma 6.2 and arguing as 
in the proof of Theorem 5.10.1 of Brillinger (1981), we get that y/ninNJi^n =>A/'(0, Vi) 
as n oo. Furthermore, by Theorem A. 2 of Dahlhaus (1997), we get v^m^JVJ2,n 
Af{0,W), where W = 2Wi + W3. An apphcation of the Cramer- Wold device leads to 
Vm„iV(Ji,„, J2,„)' ^A/'((0,0)',F), where V = {vr,s)r,s=i.,2 with vi^i^Vi, V2.2 = W and 
vi^2 = ^^1,2- Approximation (6.16) then yields assertion (6.15) with — vi^i + V2.2 + 
2t;'i,2. ' ' □ 
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